Enhancing Text Categorization with Semantic-enriched Representation and Training Data Augmentation
نویسندگان
چکیده
منابع مشابه
Research Paper: Enhancing Text Categorization with Semantic-enriched Representation and Training Data Augmentation
OBJECTIVE Acquiring and representing biomedical knowledge is an increasingly important component of contemporary bioinformatics. A critical step of the process is to identify and retrieve relevant documents among the vast volume of modern biomedical literature efficiently. In the real world, many information retrieval tasks are difficult because of high data dimensionality and the lack of annot...
متن کاملEnhancing Text Categorization with Semantic-enriched Representation and Training Data Augmentation
Objective: Acquiring and representing biomedical knowledge is an increasingly important component of contemporary bioinformatics. A critical step of the process is to identify and retrieve relevant documents among the vast volume of modern biomedical literature efficiently. In the real world, many information retrieval tasks are difficult because of high data dimensionality and the lack of anno...
متن کاملEnhancing Text Representation for Classification Tasks with Semantic Graph Structures
To represent the textual knowledge more expressively, a kind of semanticbased graph structure is proposed, in which more semantic and ordering information among terms as well as the structural information of the text are incorporated. Such model can be constructed by extracting representative terms from texts and their mutually semantic relationships. Afterward, it is represented as a graph, wh...
متن کاملText Categorization with Semantic Commonsense Knowledge
Most of text categorization research exploit bag-of-words text representation. In this approach, however, all contextual information contained in text is neglected. Therefore, capturing semantic similarity between text documents that share very little or even no vocabulary is not possible. In this paper we present an approach that combines well established kernel text classifiers with external ...
متن کاملText Representation for Automatic Text Categorization
Automatic Text Categorization (ATC), the automatic assignment of text documents to predefined classes, is a language engineering task very relevant to a number of applications, including automatic content and knowledge management in corporations and the Internet, information access and filtering, etc. With first works dating back to 60’s [14], and increased work in the last decade (see the surv...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the American Medical Informatics Association
سال: 2006
ISSN: 1067-5027,1527-974X
DOI: 10.1197/jamia.m2051